Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association
نویسندگان
چکیده
To date, gene-based rare variant testing approaches have focused on aggregating information across sets of variants to maximize statistical power in identifying genes showing significant association with diseases. Beyond identifying genes that are associated with diseases, the identification of causal variant(s) in those genes and estimation of their effect is crucial for planning replication studies and characterizing the genetic architecture of the locus. However, we illustrate that straightforward single-marker association statistics can suffer from substantial bias introduced by conditioning on gene-based test significance, due to the phenomenon often referred to as "winner's curse." We illustrate the ramifications of this bias on variant effect size estimation and variant prioritization/ranking approaches, outline parameters of genetic architecture that affect this bias, and propose a bootstrap resampling method to correct for this bias. We find that our correction method significantly reduces the bias due to winner's curse (average two-fold decrease in bias, p < 2.2 × 10-6) and, consequently, substantially improves mean squared error and variant prioritization/ranking. The method is particularly helpful in adjustment for winner's curse effects when the initial gene-based test has low power and for relatively more common, non-causal variants. Adjustment for winner's curse is recommended for all post-hoc estimation and ranking of variants after a gene-based test. Further work is necessary to continue seeking ways to reduce bias and improve inference in post-hoc analysis of gene-based tests under a wide variety of genetic architectures.
منابع مشابه
Evaluating basic approaches to post-hoc analysis for commonly used, gene-based rare variant tests of association
متن کامل
Association between bovine lactoferrin gene variant and somatic cell count in milk based on EcoRI restriction site
Mastitis is one of the most serious and costly diseases affecting dairy cattle production. In the present study, effects of a lactoferrin gene polymorphism (intron 6) on milk somatic cell count (SCC) and subclinical mastitis was investigated in 121 Holstein dairy cattle. Two alleles of A and B and two genotypes of AA and AB were found in an EcoRI recognized single nucleotide polymorphism in int...
متن کاملIdentification of a Novel Splice Site Mutation in RUNX2 Gene in a Family with Rare Autosomal Dominant Cleidocranial Dysplasia
Introduction: Pathogenic variants of RUNX2, a gene that encodes an osteoblast-specific transcription factor, have been shown as the cause of CCD, which is a rare hereditary skeletal and dental disorder with dominant mode of inheritance and a broad range of clinical variability. Due to the relative lack of clinical complications resulting in CCD, the medical diagnosis of this disorder is challen...
متن کاملP-215: Discovery of A Novel APA Variant of A Human Potential Gene Based on Expressed Sequenced Tags Analysis
Background: Expressed sequence tags (ESTs) are sequences of cDNA fragments prepared from different tissue sources. There are over one million of these sequences in the publicly available database, and these sequences are believed to represent more than half of all human genes. The ESTs belong to different cDNA libraries, was prepared from one particular cell type, organ, or tumor. Therefore, th...
متن کاملHemoglobin Daneshgah-Tehran (HBA1:c.218A>G p.His72Arg): a Rare α1-Globin Variant from Iran
There are more than 400 different variations on α-globin protein, and most of them are not associated with noticeable clinical manifestation. Hemoglobin (Hb) is an oxygen-transporting protein and Hb Daneshgah- Tehran is an α-globin variant that for the first time was reported from Iran in a case with normal haematological indices. The capillary electrophoresis of an 8-year- old-girl with normal...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 8 شماره
صفحات -
تاریخ انتشار 2017